Statistical Consistency of Ranking Methods in A Rank-Differentiable Probability Space
نویسندگان
چکیده
This paper is concerned with the statistical consistency of ranking methods. Recently, it was proven that many commonly used pairwise ranking methods are inconsistent with the weighted pairwise disagreement loss (WPDL), which can be viewed as the true loss of ranking, even in a low-noise setting. This result is interesting but also surprising, given that the pairwise ranking methods have been shown very effective in practice. In this paper, we argue that the aforementioned result might not be conclusive, depending on what kind of assumptions are used. We give a new assumption that the labels of objects to rank lie in a rank-differentiable probability space (RDPS), and prove that the pairwise ranking methods become consistent with WPDL under this assumption. What is especially inspiring is that RDPS is actually not stronger than but similar to the low-noise setting. Our studies provide theoretical justifications of some empirical findings on pairwise ranking methods that are unexplained before, which bridge the gap between theory and applications.
منابع مشابه
Statistical Inference for Incomplete Ranking Data: The Case of Rank-Dependent Coarsening
We consider the problem of statistical inference for ranking data, specifically rank aggregation, under the assumption that samples are incomplete in the sense of not comprising all choice alternatives. In contrast to most existing methods, we explicitly model the process of turning a full ranking into an incomplete one, which we call the coarsening process. To this end, we propose the concept ...
متن کاملEfficiency distribution and expected efficiencies in DEA with imprecise data
Several methods have been proposed for ranking the decision-making units (DMUs) in data envelopment analysis (DEA) with imprecise data. Some methods have only used the upper bound efficiencies to rank DMUs. However, some other methods have considered both of the lower and upper bound efficiencies to rank DMUs. The current paper shows that these methods did not consider the DEA axioms and may be...
متن کاملRank Consistent Estimation: The DOP Case
The goal of an estimator is to approximate the unknown distribution of the language from its partial evidence. In this thesis, a rank consistent estimator is defined as an estimator that preserves the ranking frequencies of all the full parse trees in the treebank proved to be rank consistent with respect to the training treebank. The rank consistency property adopts Laplace’s Principle of Insu...
متن کاملA tree-based ranking algorithm and approximation of the optimal ROC curve
Recursive partitioning methods are among the most popular techniques in machine-learning. It is the purpose of this paper to investigate how such an appealing methodology may be adapted to the bipartite ranking problem. In ranking, the goal pursued is global: the matter is to learn how to define an order on the whole feature space X , so that positive instances take up the top ranks with maximu...
متن کاملA New Method for Ranking Distribution Companies with Several Scenarios Data by Using DEA/MADM
In Data Envelopment Analysis, uncertain data are the inseparable part of real models. Natural models usually deal with uncertain and probable data. Many researchers prioritize these kinds of data. For instance, they study fuzzy data, interval data, probabilistic models etc. In this article, we proposed a method in which the decision making units are uncertain in their inputs and outputs. In the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012